Project Information

Project Description

Using multiple machine learning algorithms to generate the best predicitive regression model on the degree of solubility of a chemical compound given its molecular formula.
The training data is provided in the file listed as solubility_train.fp, this has the chemical structure in SMILES format, followed by the Solubility in log(S) values, and then the binary features representation in traditional MACCS fingerprints.